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Abstract 



o 
o 

£NJ ■ We discuss the definition of quantum probability in the context of "timeless" general-relativistic 

quantum mechanics. In particular, we study the probability of sequences of events, or multi-event 
probability. In conventional quantum mechanics this can be obtained by means of the "wave 
function collapse" algorithm. We first point out certain difficulties of some natural definitions of 
multi-event probability, including the conditional probability widely considered in the literature. 
We then observe that multi-event probability can be reduced to single-event probability, by taking 
into account the quantum nature of the measuring apparatus. In fact, by exploiting the von- 
Neumann freedom of moving the quantum/classical boundary, one can always trade a sequence 
^ • of non-commuting quantum measurements at different times, with an ensemble of simultaneous 

commuting measurements on the joint system+apparatus system. This observation permits a 
formulation of quantum theory based only on single-event probability, where the results of the 
"wave function collapse" algorithm can nevertheless be recovered. The discussion bears also on 
the nature of the quantum collapse. 

1 Introduction 

o 

The general-relativistic revolution of our understanding of space and time has proven extremely ef- 
V-h , fective empirically. Conventional textbook quantum mechanics (QM), and conventional quantum 

field theory (QFT), however, are formulated in a language which is incompatible with the general- 
relativistic notions of space and, especially, time. Is there a formulation of QM compatible with these 
notions? Such a formulation should be required, in particular, in order to provide a clear interpretative 
framework to any attempt to formulate a quantum theory of gravity in a form consistent with the 
& ' general-relativistic understanding of space and time pQ. 

The main difficulty in extending QM to the general-relativistic context comes from the notion of 
time. (Historical, if a bit out-of-date reviews are OH]. See also the recent 0] and In conven- 
tional QM (including QFT), the independent time variable t is interpreted as an observable parameter 
independent from the physical system considered. States and probability amplitudes evolve determin- 
istically in t. 1 This notion of time is not general-relativistic and conflicts with general covariance, 



* Unite mixte de recherche (UMR 6207) du CNRS et des Universites de Provence (Aix-Marseille I), de la Mediterranee 
(Aix-Marseille II) et du Sud (Toulon- Var); laboratoire affilie a la FRUMAM (FR 2291). 
x In QFT this is realized via the representation of the Poincare group in the state space. 
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and so, if we take general relativity seriously, it is likely to be unsuitable to describe the world at the 
Planck scale. 

In a sum-over- histories context, this problem is addressed by Hartle's generalized QM JBJ. Here 
we work in the hamiltonian context. We summarize below, in Section |2 a hamiltonian formulation 
of QM that does not make use of this nonrelativistic notion of time, following 0, and especially 
Chapter 5 of [Hj, to which we refer for full details and a complete discussion. Following |B], we call this 
formulation "general-relativistic" . In this formulation, the probability for observing a certain event s' 
if an event s was observed, is given by the modulus square of a suitably defined transition amplitude 
V s ^s' — \-A s => s ' | 2 = I (s'\s)n | 2 , as explained below. This probability postulate coincides with standard 
QM transition probabilities in the non-general-relativistic case, but is well-defined in a wider context, 
sufficiently general to form the basis for a timeless interpretation of a general-relativistic quantum 
theory. 

However, a key problem was left open in QM does not provide only probabilities for single 
observations. It also provides probabilities for sequences of observations. For instance, it provides the 
probability V^^^'^p" for a system in a state \ip) to be observed in a state \ip') and then in a state 
One way to compute this is to assume that at the time of the first measurement the state "collapses" 
to the state |^'). It is not clear how these probabilities for ensembles of events can be computed in 
general-relativistic QM. The difficulty comes from the following fact. As is well known, in QM, given 
a un-ordered ensemble of observations, its probability depends on the time order in which they are 
performed. Now, in a quantum general relativistic context, as we recall below, the notion of time- 
evolution, and in particular the notion of time-order, is subtle due to the absence of an external notion 
of time. How do we compute, then, the probability of sequences of measurements? The problem was 
discussed for instance by Hartle in 0. A strictly related problem is that the probability defined 
in |5] concerns only the non- degenerate eigenvalues of the (partial) observables. The probability of 
observing a degenerate eigenvalue was not defined. In this paper we study a possible solution to this 
problem. 

In Section we state the problem and we point out a certain number of difficulties that emerge 
in trying to assign a probability to sets of events in a general relativistic context. In particular, we 
discuss the difficulties of two apparently "natural" solutions. The first is a direct generalization of 
the single event probability postulate: the probability of an ensemble of events is determined by the 
projection on the physical Hilbert space of the subspace of the kinematical Hilbert space associated 
to this ensemble of events. We show that this postulate is not viable because it does not reduce to the 
standard QM probabilities in the nonrelativistic case. The second is the use of conditional probability, 
widely discussed in the literature. We point out certain difficulties with the operational definition of 
this probability. 

In Section^ we indicate a possible general way for solving the problem. This is based on the obser- 
vation that a multiple-event probability, such as V^^,^>^" can always be reinterpreted as a single-event 
probability, once the dynamics and the quantum nature of the apparatus making the measurements 
are taken into account. If we do so, the time order gets naturally coded into the dynamics of the 
system. This strategy provides a general way for dealing with multiple-event probabilities in general- 
relativistic quantum mechanics. 

Finally, in Section[5]we comment on the meaning of the notion of probability in the timeless case. 
In particular, we clarify the apparent difficulty presented by the fact that probabilities assigned to 
the possible values of a variable may not sum up to one. In Section El we summarize our results and 
discuss the issues that remain open. 

We illustrate the working of this technique in a companion paper jl(Jj , where we introduce a discrete 
model that allows us to exemplifies all the general structures introduced above. The example in 
illustrates a notable convergence between the strategy introduced here and Hartle's generalized QM 
approach 0. 
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2 General relativistic quantum mechanics 

We give a rapid overview of general relativistic QM, referring to Chapter 5 of [H] for a complete 
discussion and full details. 

2.1 General relativistic classical mechanics 

A classical mechanical system is described by a number of observables quantities, that we call partial 
observables. We include under this name the dependent as well as the independent variables in the 
equations of motion of non-relativistic mechanics. Examples of partial observables are the time t, 
the position of a particle A, the momentum, the energy, the spacetime coordinates x M in special- 
relativistic field theory, the four coordinates x M of a relativistic particle, and so on. We assume that 
each partial observable can be measured by a suitable physical apparatus. 

In a system with a finite number n of degrees of freedom, we choose n + 1 partial observables 
(typically the n lagrangian variables plus the time variable) , and form the n + 1 dimensional extended 
configuration space, or event space, C. The extended configuration space of a relativistic particle is 
the Minkowski space. The extended configuration space of a homogenous and isotropic cosmological 
model where a is the volume of the universe and <p lS the matter density, is coordinatized by a and 
4>. Points in C are called events and denoted s,s',s",.-- : for instance, a point in Minkowski space 
s — x^ = (x,t), or a given value of radius of the universe and matter density s = (a,<j)), define an 
event. Measuring a complete set of partial observables -that is, determining a point in C— is to detect 
the happening of a certain event. For instance: a particle is detected in a point of Minkowski space, a 
certain value of radius of the universe and average energy density are measured, and so on. Each such 
detection describes an interaction of the system with another system, playing the role of observer. 

Dynamics is then uniquely determined by a single function H, the relativistic hamiltonian 2 , on 
the cotangent space T*C. 3 This is a symplectic space, with a natural symplectic two-form to. The 
submanifold £ defined by H = is called the constraint surface. The integral lines of the restrictions 
of uj to £ are the physical motions of the system. The space of these motions is the phase space T. 
Thus, in this context the phase space T is interpreted, a la Lagrange, as the space of the solutions of 
the equations of motion 4 , or the space of the possible "motions" of the system, instead than a space 
of initial data. Notice that a physical motion determines a continuous sequence of events in C. 

The interest of this formulation of dynamics is that the time variable t is just one of the coordinates 
of C, on equal footing with the other partial observables, as required by general covariance. In a general 
covariant system, in fact, dynamics is not the description of how physical variables evolve in a preferred 
independent variable t, but rather the description of the physical relations between partial observables 
(see the discussion in [S], Sections 3.1 and 3.2.4). 

A special case is provided by the non-relativistic systems. In these systems, one of the coordinates 
of C is singled out to play a special role: the time variable t, thus C = Co x R, where Co is the 
conventional configuration space; and H has the form 

H = p t +H (1) 

where pt is the momentum conjugate to t and Ho is independent from p t . Ho is the standard non- 
relativistic hamiltonian. It is easy to see that in this case £ is isomorphic to 1 x T*Co and the 
motions are the integral lines of the hamiltonian flow of Ho on T^Co' that is, we recover conventional 



2 It is also called "super— hamiltonian" , "hamiltonian constraint" , or "scalar constraint" . 

3 In this paper, we do not consider the case of other gauge invariances, besides the reparametrization invariancc 
generated by H. 

4 More precisely, as the space of the equivalence classes of solutions, up to gauge. 
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hamiltonian mechanics. Our main interest, of course, is for systems where H does not have the form 
CJ. These are the systems for which conventional QM is insufficient , and general relativity is among 
(the oo-number of degrees of freedom version of) these systems. 



2.2 General relativistic quantum mechanics: basic structure 

General relativistic quantum mechanics is defined by a kinematical Hilbert space IC carrying an al- 
gebra of operators corresponding to the partial observables. The dynamics is given by a generalized 
projection operator P in IC. 

The relation with a classical system is as follows. A linear representation of the Poisson algebra 
of coordinates and momenta on T*C defines the kinematical Hilbert space IC. For instance, we may 
take a Schrodinger representation IC = L2[C\. The relativistic hamiltonian H defines a self-adjoint 
quantum operator (that we denote with the same symbol) H on IC. The kernel of H, formed by the 
(possibly generalized) states in IC satisfying the "Wheeler-DeWitt" equation 

Hip = (2) 

is called the physical state space H. P is the linear (self-adjoint) operator P : fC—^Ti, given by: 

PiP = I ' dn e- mH ip (3) 



(we put h = 1) and loosely called "the projector", since ip _L H Pip = 0. 5 For periodic systems 
the range of integration is the period, for the others it is the real line (recall that we are only working 
with reparametrization invariant systems). If zero is in the discrete (resp. continuous) spectrum of H, 
then H is a proper (resp. generalized) eigenspace of IC. On the linear space TL we consider the Hilbert 
structure 

(Ps'\P S ) H := (s'|P|s) , (4) 

which is well defined (on a dense subspace of K.) even when H is a generalized eigenspace; see for 
instance ^Sj and the discussion in Section 5.5.2 of 0. We also write (s'\s)-h = (Fs'\Ws)n- Remark 
that since in general P is not a true projector, (s'|P|s) may very well be different from (Ps'|Ps). In 
particular, this last quantity is in general divergent in the case in which H. is a generalized eigenspace. 

States in IC have a physical interpretation 6 , as follows. If \s) € K, is an eigenstate of a complete 
set of commuting partial observable (self-adjoint) operators, with eigenvalues (a, b,c...), then |s) is 
interpreted as describing the event in C with coordinates (a, b, a..). That is, it describes an interaction 
of the system with another physical system, in which the values (a, 6, c.) are realized. 

In a non-relativistic system, where Q holds, the Wheeler-DeWitt equation J2J) becomes the 
Schrodinger equation and P is strictly related to the unitary evolution operator U(t) = e~ lHat . 
As a simple example, consider a 2-state spin system with time-independent hamiltonian Hq. Here 
IC = C 2 ® LpEt] is formed by states of the form ips{t), S =T, I and spanned by the (generalized) basis 
\S,t), where ip s (t) = (S,t\ip), and 

dr e - lT(pt+Ha) \S,t) = j dr e~ iHoT \S,t + r). (5) 

P sends \S, t) into a solutions of the Schrodinger equation. The physical space H is given by these 
solutions and clearly 7i ~ C 2 (any initial state generates one and only one solution of the Schrodinger 



5 If K/H has finite dimension N then P = 7VP, with P the true projector from IC to Ti.. 
6 This is the distinguishing feature of the interpretation we are considering. 
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equation). A basis in Ti can be obtained by choosing a reference time to and defining \S) := P|5, to). 
It follows easily P|5, t) = C/ f (t - t Q )\S). Then notice that 

(S',t'\F\S,t) = (S , \e- iHo ^-^\S), (6) 
that is, the matrix elements of P are essentially the matrix elements of the unitary evolution operator. 

2.3 Single event probability 

The main interpretation postulate of general relativistic QM is the following (see The probability 
Vs^s' of observing the event s' if the event s was observed (we shall write V s =$. s ' simply as V s ' when 
there is no need of indicating the initial state) is given by the modulus square of the amplitude 

A s ^ s , = ( S '|P|s), (7) 

where the states are normalized in Ti., not in IC, that is 

(s|P|s) = <s'|P|s') = 1. (8) 

There are several equivalent ways of writing this probability. We can define the projection operator 

II S , = P\s') (s'|P = |Ps') (Ps'| (9) 
and write the probability as the expectation value 

= |^W| 2 = (s\U s ,\s), (10) 

where again |s) and \s'} are normalized by (JSJ. Alternatively, recalling the notation (s'\s)-h = 
(Ps'\Ps)-h ■= (s'\P\s), and explicitly including the normalization in the expression for the probability, 
we can write 

v = WV)n? = |(P.s'|P*)h| 2 nn 
^' (s'\s') H (s\s) n (Ps'|Ps')w <Ps|Ps)«- 1 ' 

Notice that this probability is a standard quantum mechanical probability computed in the physical 
Hilbert space Tt, in the following sense. The states |s) and \s') in IC "project" down to physical states 
\¥s) and \¥s'} in H. The probability is then simply the standard probability amplitude of 
measuring the physical state |Ps') if the physical state |Ps) was measured. In other words, HV is 
the projector onto |Ps'), in 7i, and it is a genuine Dirac operator. Therefore general relativistic QM 
simplifies, but does not contradict, the standard Dirac treatment of a constrained system. This fact, 
by the way, assures us that the amplitude ^ yields a probability less or equal to one. 

This formulation is very close in spirit to the conventional scattering formulation of QFT, where 
probabilities are defined by transition amplitudes between initial and final asymptotic states, and 
these are defined as timeless Heisenberg states, or full "histories" of the system: see for instance 
Weinberg's clear discussion in Chapter 3 of [TT] , 

This definition of probability reduces to the conventional one in the non-relativistic case. In the 
spin system considered above, for instance, the states PIS', t) are normalized and the amplitude for 
measuring the spin 5" at time t' if S at time t was measured is 

Ast^s'f = (S',t'\¥\S,t) = (S'\e- m ^'-^\S), (12) 

in agreemenet with conventional QM. More in general, in the case in which H — p t +Ho, the definitions 
above reduce to the standard QM postulates regarding states, observables and probability. 

However, there is one additional postulate required to define quantum theory: the collapse postu- 
late, stating that after a measurement, the state changes and becomes an eigenstate of the operator 
associated to the measurement. The translation of this postulate to the relativistic formalism is our 
concern here. 
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3 Multiple-event probability 



Consider a partial observable A in JC and let a be one of its eigenvalues. If a is non-degenerate, and 
\s') is the corresponding eigenstate, then J7J) provides the probability amplitude of measuring a. What 
happens if a is degenerate? 

Let us say for simplicity that a is doubly degenerate, and that \s') and \s") arc two orthogonal 
eigenstates having eigenvalue a, that is, they span the a-eigenspace JC a - Then, to measure the eigen- 
value a, or, equivalently, to measure its associated projection operator TT a = \s')(s'\ + \s")(s"\, means 
that we have a measuring apparatus that gives us a Yes answer if either the event s' or the event s" 
happen (Yes answer corresponds to the eigenvalue 1 of 7t a ). In order to compute the probability of 
having a Yes answer, we need therefore the probability Vs'ors" that the event s' OR the event s" 
happens. 

Alternatively, suppose that we have a measuring apparatus that gives us a Yes answer if both the 
event s' and the event s" happen. In order to compute the probability of having a Yes answer, we 
need therefore the probability V s > and s" that the event s' AND the event s" happen. The solution of 
one case gives immediately the solution of the other since, clearly 

Ps' ORs" =r P.S r ^Ts" ^s' AND s" ■ (13) 

There are two possibilities: either P S / ANDS " is always zero, or not. Consider the two cases separately. 



(i) Mutually exclusive events. If P s ^( s : Ams ») = for any s, then s' and s" are alternative 
events that cannot both happen. That is, if one happens, the probability that the other happens is 
zero. By Q and the given interpretation, this is equivalent to 

(s'\¥\s") = 0. (14) 

In this case, (fT3)) gives 

V a > ORa »=V^+V a ». (15) 

That is, the probability of s' OR s" is simply the sum of the probabilities of s' and s". Observe that 
this can be written generalizing (|10|l to 

V s ^ a = {s\U a \s}, (16) 

where II a is the orthogonal projector on the subspace Ha = PIC a in H, which, if 114(1 holds, is simply 
given by (remember we are requiring JHJ) 

n a = \¥s / )(Fs / \ + \¥s")(¥s"\. (17) 

A typical example is the following. In the two-state spin system considered in the previous section, 
let \s') = 1 1,*> and \s") = \i,t). In this case, (s'\F\s") = (| \U (0)| |) = 0. The two events are mutually 
exclusive. Therefore V s > and s" = 0. The projector on the a-eigenspace K, a is 

7r = |T,*)<T,t| + \i,t){i,t\ . (18) 

The projection 7i a of K a in TL is spanned by the two orthogonal states P |f , t) = W(t — to)\ | ) and 
¥\l,t) = U^(t-t )\i), therefore 

n a = u\t-to) (itXTI + ) u(t-t ). (19) 

In this two-state system, II a = 11 and the corresponding probability is V a = 1- Not so, of course, in 
general. 
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(ii) Non exclusive events. The interesting case is when 

(s'\F\s") ^ 0. (20) 

A typical example is the following. Let |s') = | ],t') and \s") = | <-,i") = IT ''" ) ^ i,t " ) . In this case, 
(s"|P|s') = ( *— \U(t" — t')\ |) 7^ 0, in general. The question we are asking is: what is the probability 
of detecting the spin f at time t' AND the spin <— at t"? The problem is of course well posed: if a 
particle is in a certain initial state at t, what is the probability of finding it with a certain spin at 
time t' AND with another spin at a later time t"? This can be measured by measuring the fraction 
of a beam that passes through a sequences of two Stern-Gerlach apparatuses. 

Now, in ordinary quantum mechanics, these probabilities depends on the time ordering between 
the events. For instance, let the initial state |s) be the state |— ►) = at time to, and let U(t) = 1 

for all t. Then 

*P s=>(V AND S") 

Because the sequence 

h) => 

has probability 1/4; while the sequence 

h> => 

cannot happen. The standard way of obtaining these probabilities in conventional quantum mechanics 
is via the projection postulate. For instance, say t' < t" , that is, case Q22fl. We have: (i) at time 
t' the spin j is measured with probability |(f | ^)| 2 = 1/2; (ii) the state is hence projected to |f); 
(Hi) at time t" the spin <— is measured with probability |(<— | |)| 2 = 1/2, giving total probability 
1/2x1/2 = 1/4. 

Standard QM gives also, easily 

^Ps^>(s ! OR s") 

Comparing with (|13fl . notice that the probabilities V s ^, s ' and V s ^ s " relevant here (with two detectors) 
are different from the probabilities V s ^s' and V s =?s" relevant when only one detector is present. For 
instance, in the first case, we have T s ^ s " = \ {— ► | T)(T I ^)| 2 + I (—* I I) (I I O | 2 = |j because of the 
presence of a detector in s'; while in the absence of this, we would clearly have V S ^ S " = |(— *• | ^)| 2 = 0. 
This well known fact will play an important role below. 

How do we recover these probabilities in relativistic QM, where we do not have a notion of time 
ordering in tl 

3.1 Two false tracks 

3.1.1 Taking (I1C>[) as the general probability postulate 

Let us suppose we ignore for the moment the time-ordering problem, and we try to directly associate 
a probability to the a-eigenspace JC a , as we did in the case (i) (mutually exclusive events). Notice 
that the definition (|16(l of probability remains meaningful also in the case (ii) (non mutually exclusive 
events). It is therefore very tempting to suppose that the probability is still given by (jlfijl also in this 
case. 

In the example above, for instance, the eigenspace K, a is spanned by the two events considered 
(which are orthogonal in K, because they are at different time), and the projection operator on JC a is 

n a = \tt')($,t'\ + \^,t")hX\- (25) 



IT) 



if t' < t" , 
if t" < t' . 



IT) 



(21) 
(22) 
(23) 



if a < t", 
if t" < f. 



(24) 
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The projection Ti a of JC a to Ti is spanned by the two states P| |, t') and P| «— , i") . This time these two 
states are not orthogonal in H. We can nevertheless still consider the possibility that the probability 
V a is given by (|16|) . where n a is the projection operator on the space Ti a they span. 

The probability postulate (|16|) is not of completely straightforward utilization in the case in which 
the orthogonal eigenstates of 7r a are projected to non-orthogonal states in H., namely in the case (I20II . 
because in this case Ijl7(l is not true. This is a technical difficulty that can be addressed by standard 
linear algebra methods, for instance via an orthogonalization procedure. A more powerful technique 
is to observe that the projector on the linear space spanned by a set of possibly linearly dependent 
states S — llu 1 )..-!"^)} can be written in the form 

Ha = £ |u<> Gy <u*| (26) 

ij 

where the sum is over any maximal subset of linear independent sates in S and the matrix Gij is the 
inverse of their Gramm matrix G lJ = {u l \u J ). 

Unfortunately, however, there is a serious problem: this probability postulate is wrong because 
it does not reduce to the conventional and well tested probabilities of nonrelativistic QM. This can 
be seen from the following example. Consider the spin case mentioned above. The two kinematical 
states | a') and \s") are orthogonal in JC (because they are at different times). If we project them 
down to H, the two resulting states |Ps') and |Ps") are not anymore orthogonal, but they are still 
linearly independent. Therefore they still span a two-dimensional space. Since the space they span is 
Ha, it follows that 7i a is two-dimensional. But so is H in this example. Hence H a = T~t- It follows 
immediately that II a is the identity operator in Tt, and therefore that i|16|) states that the probability 
of measuring either s' or s" is always equal to unity. This is in contradiction with the correct result 
H24fl given by nonrelativistic QM. 

Therefore the probability formula 1)16(1 is not correct in the case (ii) of compatible events. The 
difficulty appears to be in the fact that the formalism ignores that s' happens before s". 



3.1.2 Conditional probabilities 

The idea that in the timeless case the interpretation of QM can be entirely based on conditional 
probabilities has been suggested in ^2] an d is very attractive. The idea has been widely discussed 
and also criticized, see for instance [2], but has recently received new attention. For instance, in [JJ], 
C. Dolby has provided an interesting and convincing reply to the criticisms in 0. We refer therefore 
here to Dolby's version of the conditional probability interpretation. For full references, see the two 
papers quoted. 

Dolby gives a postulate for the probability Via when b) that an event a happens together with (as 
the same time as) an event b. 7 The events a and b are described by commuting projector operators 
7r a and 7Tb in IC. Dolby's probability postulate is, in our notation (and not writing explicitly the 
dependence of the probability on the state), 

(s\¥n a Tr b P\ S ) 

?(flwhen ^ (s\Pn b P\s) ■ (27) 
Suppose that we have a set {a\, a n } of events such that 

^ = 1 - ( 28 ) 



7 Carefully distinguishing Via when b) from the conditional probability Via, if b). 
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It follows immediately from Dolby's definition that 



^ V(a n when b) = 1. 



(29) 



In order to illustrate the difficulty with this definition of probability, consider the two-state system 
introduced in Section 12.21 but let us imagine, for simplicity, that time is discrete. That is, the 
states are ips(t n ), S =|, I where ips(t) = (S,t n \ijj), with integer n (as in the example we will discuss 
in the companion paper jlO)). In the conventional formalism one focus on probabilities of the form 
■p(| when t n ), where the event (f, t n ) is considered as one element of the set of equal-time alternatives 
&t n = {(T) t n )(l, tn)}- But the general formalism does not privilege the time variable and therefore 
allows us to consider also probabilities of the form V(t n when |), where the event (t, t n ) is considered 
as one element of the set of alternatives <Sj = {..., (f, t n -i), (T> tn), (T> t n +\), ■■■}• Let us therefore study 
the interpretation of these. If we take 7Tf, = 7T|, the projector on the | eigenspace of the spin operator, 
and TT an — 7r tre , the projector on the t n eigenspace of the t operator in lj27JI . we have the probability 
V{t n when f) to find the particle at time t n "when" the spin is |. Let us calculate this probability for 
a state such that, in particular (assume the dynamics is such that this is a physical state) 



I, if n = l,2. 
0, otherwise. 



(30) 



There are two different times at which the spin is j": 
immediately 



t\ and t 2 - By symmetry and i|29|) . we have 



V(h when |) = V(t 2 when |) = 



f 



(31) 



At first sight, this looks reasonable: since the particle has spin up at two different times, if the spin is 
up, then there might be a 50/50 probability that it is one or the other of these two times. But can we 
give a more precise definition of this probability? That is, can we give a precise operational procedure 
for measuring this probability? Here is where we see a difficulty. Indeed, we see two possibilities for 
an operational interpretation of this probability, but neither appears to work in general. The following 
are the two possibilities. 

(i) There is a single detector at t = t n , that detects the spin f, and we interpret V{t\ when f) 
as (the limit of) the ratio of the number of detections over the number of trials. This is clearly not 
viable, because under the conditions given, this ratio is equal to one, not to 1/2: a detector which is 
on at time t\ will always detect the particle, not just half of the times. How is this probability equal 
to one accounted for in Dolby's scheme? 

(ii) We have one detector at each t n , each sensitive only to the spin |. We call N tn ^ the number 
of times the detector at t n clicks, and call iVf the total number of detections, and we interpret 
7 , (<i when f) as the limit of N tl ^/N^. This works, because at each trial both detectors at t\ and t 2 
click, so that we get the correct 1/2. 

However, (ii) does not work in general, because of the fact that, as noticed after l|24|l . the very 
presence of detectors alters quantum mechanical probabilities. For instance, suppose we have a system 
with four states \i), i = 1, 2, 3, 4, with a time dependent dynamics given by ip(t n+ i) = LKjn^ftn) where 



U(t )=U(t 1 



/ 1 \ 

10 

10 

y i o o o ) 



U{t 2 



/ 1 -1 \ 
11 
110 
-110 / 



(32) 



9 



and U(t n ) = 1 for any other n 



Say the initial state is ^(to) 



(|3) + |4))/V2. Then easily 



lp(tn>i) 



(|3) + |4))/A 
(|3) + |4))/A 
(|3) + |4))/V2. 



(|3) + |1))/A 
|2>, 



(33) 



Hence 



1 



(34) 



V2' 



and Dolby's probability gives 



V{t n when 



i = 1) = 5* 



(35) 



But, accordingly with quantum mechanics, if we have detectors in i = 1 for all t n , then at t\ the state 
is projected with probability 1/2 on |1), and with probability 1/2 on |3). In both cases, the detector 
at t$ will click with probability 1/2, as a straightforward calculation shows. Hence the probabilities 
are: 1/4 that t\ alone clicks, 1/4 that t$ alone clicks, 1/4 that t\ and £3 click, 1/4 that no detector 
click. The ratio N^JN^ is therefore 1/2, different from Dolby's probability. The point is of course 
that the very presence of the detectors changes the wave function at later times, and this is not taken 
into account by Dolby's expression. 

Thus, neither the operational interpretation (i), nor the operational interpretation (ii) work. What 
is then the meaning of this probability? We do not exclude that third consistent operational definitions 
of Dolby's probability could be given, but we do not yet see it. (On this problem, see |I2|.) Dolby 
appears to be aware of this difficulty: in 0, he writes "V(a when b) is best thought as representing the 
proportion of the physical path on which a and b are simultaneously true, divided by the proportion 
on which b is true" . This is clearly intuitively okay, but in strident contradiction with the fact that 
there is no way of measuring "physical paths" in quantum mechanics. The difficulty is that Dolby's 
probability is defined as if the detectors did not affect the quantum states; but this contradicts the 
very physical contents of quantum theory. 

For clarity, the difficulty we are raising is not that Dolby's scheme cannot take into account situa- 
tions with several measurements. Dolby defines also probabilities, such as V{a,\ when b\\a-i when 62) 
that do so. The point we are raising is that we do not understand how to interpret, that is, how to 
concretely measure, Dolby's probabilities when the b variable in Ij27(l is not conventional time. 

Thus, in spite of the attractiveness of the conditional probability interpretation, we find that its 
foundation is not solid. Therefore we do not see, at present, how to take it as a general basis for 
relativistic quantum mechanics. The present work can be seen, to a large extent, as an attempt to 
ameliorate the conditional probability interpretation, but putting it on firmer grounds. 

4 Multi-event probability from the coupling of an apparatus 
4.1 The basic idea 

It is perhaps the very central physical content of quantum theory that certain questions cannot be 
combined by AND and OR, even if we can do so in classical physics. For instance, we can meaningfully 
ask if an electron is in a certain region of space, and we can meaningfully ask if an electron has a 
certain energy. But asking if an electron "is in a region AND has a certain energy" , makes no sense in 
quantum theory, beyond a certain precision. Position and energy are represented by non-commuting 
operators in the quantum theory, and we can assign an electron a definite position and a definite 
energy, but not both. Bohr has emphasized that this is related to the fact that electron manifests 
its position in a certain context of interaction and it manifests its energy in a different context, but 
there is no interaction context in which the two can be manifested together. Position and energy are 
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incompatible observables that cannot be measured together, and therefore it is perfectly okay that we 
cannot assign them joint probabilities. 

At the light of this well known basic consideration, let us return to the case of the measurement of 
the s' and s" events, in the case (ii) of Section considered above. For concreteness, let us consider 
again the example in which s' represent the spin being | at a time t' and s" represents the spin 
being «— at a later time t" . Since in the physical state space the two states \Vs') and |Ps") are not 
orthogonal, they cannot be interpreted as (non-degenerate) eigenstates of commuting operators. We 
are led to the tentative idea that they represent incompatible observables, to which we cannot assign 
joint probabilities. After all, it is hard to say that two events at different times can happen together! 

At first sight, this sounds as an elegant solution of the problem. But at a moment of reflection, 
something is clearly missing: Indeed, it does make sense to ask the experimental question of what is 
the probability for a spin system to have spin | at time t' AND spin <— at a later time t" . This is a 
statement that gets a precise meaning in an appropriate measurement context. In conventional QM, 
it can be dealt with by separating the two measurements in time, and using the collapse algorithm 
to compute joint probabilities. Can we obtain the same probabilities without invoking the collapse 
postulate? As well known, the answer is yes, and follows from an analysis of the experimental situation 
involved in the experiment in which we measure the two spins at different times. The key is to bring 
the apparatus that makes the measurement into the picture. 

4.2 Multi— event probability from the single— event one in nonrelativistic 
QM 

Let us see how this works in conventional QM. Consider an initial state at time t. What is the 
probability of detecting the state \ip') at time t' AND the state \ifi") at time t" > t'l This can be 
computed by projecting on \ip') at time t', which gives 

= \(ip"\U(t" - t')n 4 ,,U(t' - t)\^}\ 2 . (36) 

where H^i = \ip')(ifj'\ and all states are normalized = 1. Now, describe the apparatus measuring 

ip 1 as a two-state system which is initially in a state \No), which interacts with the system at the time 
t' , jumping to the state \Yes) if and only if the state of the system is in the state ip' . The interacting 
dynamics is then given by the unitary evolution operator U^^t' defined by 

{^',A"\U^ >V {H' - t)\1>,A) = (ip"\U(t" - t)\i>) 5 A „ A (37) 

for t and t" both larger or smaller than the interaction time t' , and 

{i>",A"\U^ iV {t" - t)\i>,A) = {^"\U{t" - t') (n r l A »A + (1 - rV)<W)^(i' - t)\i/>) (38) 

if t" > t' > i, where 

£aa'=(J J) (39) 

is a matrix that flips the apparatus state. Then the question we are interested in can be rephrased 
as follows: "Given an initial state \ip,No) at time t, what is the probability of measuring the state 
\tp",Yes) at time t"T' Notice that "the system state is lip"}" and "the apparatus state is \Yes)" , are 
compatible statements in quantum theory: they refer to orthogonal observables, both at time t" , that 
are, and can be, measured together! In other words, the question can be captured by the single-event 
probability amplitude 

7V>^« = \(ip",Yes\U^ tt ,(t" -t)\iP,No)\ 2 . (40) 

The same observation can also be illustrated as follows. The probability of a time-ordered sequence 
of events, such as (|36ll . can be written in the form 

P^'V-" = MIV^IV) (41) 
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where the history operator 

IV,,/,» = U(t - t')n^U(t' - t")IL^U(t" - t')Tl^U{t' - t) (42) 

is not a projector. But this same probability can be written as the expectation value of a projection 
operator if we enlarge the state space to include the apparatus. In fact, (|40(l gives 

Pv^v'^"' = (^,No\% fiill ,\i),No) (43) 

where the operator 

n^',^" = U^, t , Hili» t Yes Ujji'f (44) 

is a projector. In fact, it projects on the state Ul, t , , Yes). That is, the probability of a sequence 
of events can still be obtained from the basic probability postulate 

4.3 Multi— event probability from the single— event one in relativistic QM 

Let us translate the above observation in the language of general relativistic quantum mechanics. An 
apparatus is now a two-state system which is initially in a state \No), and interacts with the system 
at the event s', jumping to the state |Yes) if and only if the event s' happens. The question "Given 
an event s, what is the probability of detecting an event s' AND an event s"T'' can be rephrased as 
follows: "Given an event \s,No), what is the probability of detecting the event \s" ,Yes)T' In other 
words, the question can be captured by a single-event probability amplitude 

A s ^.(s'AND s") = A-s,No^s",Yes = (s" ,Yes\V s ,\s, No) , (45) 

provided that the dynamical operator Py takes appropriately into account the coupling between 
system and apparatus at the event s' . 

It is clear that this strategy works in general, for arbitrary sequences of measurements. Sequences 
of incompatible measurements can always be reinterpreted as compatible measurements of apparatuses 
that have interacted with the system. In fact, it is clear that the only operational meaning that we 
can ascribe to the probability that something happen at time t' AND something happen at a different 
time t" regards, in reality, the probability of a simultaneous check of records of (one at least of) these 
two events. Using this strategy, any probability for sequences of observations can be reduced to a 
probability for eigenvalues of commuting observables, and therefore be reduced to the single event 
probability 0. 

Where is the information about time-ordering gone? It has been coded into the specification 
of the interaction between the system and the apparatus, namely in a new projection operator P 
that includes the interaction with the apparatus. The fact that the probability of a sequence of 
events depends on the time ordering of the measurements can be coded into the specification of the 
dynamics of the physical interaction between the system and the apparatus. This observation allows 
us to completely reduce multi-event probabilities to single-event probability, and therefore to obtain 
all relevant quantum mechanical probabilities from the single probability postulate J7J. 

The reader might object that we have only shifted the difficulty from the problem of defining the 
general formalism of quantum theory to the problem of constructing a suitable dynamical operator 
Py capable of capturing the abstract idea of a "detection at the event s'" . We think this objection is 
ill-founded, for the following reason. In dealing with physics the problem is not to describe abstract 
ideas, but to describe what we do concretely, and to give predictions about concrete experimental 
situations. Each concrete experimental situation has to be described by a specific dynamics, and 
therefore by a specific P s / operator. The question is not whether or not a concrete system-apparatus 
interaction describes or not an abstract idea of measurement: the question is to find a formalism 
capable of describing and predicting any concrete physical situation. The problem of writing the 
correct dynamics describing the situation at hand is a concrete problem in the application of QM, not 
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in the definition of its general structure. The general theory does not say what happens at different 
times: for every physical situation it gives the probability distribution for all the events, including 
those that we may wish to view as records of previous events. 

4.4 A simple model 

Let us illustrate this idea in the simple spin system model we have used so far. This shows how P is 
modified by the presence of the apparatus. Recall that here JC = C 2 ®L[M.} spanned by a (generalized) 
basis of states \S, £),S=f,L Then H = C 2 and, choosing the reference time t = 0, F\S,t) = W(t)\S). 
We couple an apparatus with state space C 2 , spanned by the states \A), A = Yes, No. The kinematical 
Hilbert space becomes K. = C 2 <g) C 2 <£> L[R] spanned by a (generalized) basis of states \S, A,t). The 
modified dynamics, that includes an appropriate interaction is given, say, by the operator Pj t < defined 
as 

(S",A",t"\F w \S,A,t) = U S " S (t"-t) <W (46) 
if t and t" are both smaller or larger than t' (the interaction time); and 

(S",A",t"\F n ,\S,A,t) = J2 Us>i s \t" - t') (4, X A „ A + S l s , 5a«a) U s > s (t' ~ «) (47) 

if t < if < t". Notice that if and only if the spin is up, the apparatus flips state at time t' . That is, 
the dynamics defined by P| t / is the one that describes precisely an apparatus measuring f at time t' . 
Equivalently, we can write (|47J) posing (always for t < t' < t") 

F n > =F®l app + Pn ti ,P®I (48) 
where (A\l app \A') = 6 A a> and (A\1\A'} =1 A a> is 

Iaa. = (49) 

which emphasize the fact that P| t / is equal to P plus an interaction term at s' . 

We can now ask what is the probability of detecting, say, the state \s') — |<— , Yes, t") in the K. 
space of the coupled (system+apparatus) system, given that the state \s) = \— >, No,t) was detected. 
Here s' and s are events of the coupled (system+apparatus) system. Then J7J and l|47() give the 
correct result obtained from conventional QM. For instance, if, say, U(t) = 1 (cf. (|22(l ). we obtain 
f s ^(s' ands") = 1/4 if t < t' < t" and V s ^ s i A nds") = if t < t" < t' , to be compared with (|2"T|l . 

An example of application of this idea to a system genuinely without unitary time evolution will 
be given in the companion paper |l(Jj . 

5 The meaning of probability 

Before concluding, we discuss here a common objection to the assignment of probabilities to individual 
events, without reference to equal time surfaces. 

A probability is meaningful only if it assigned to an event s out of a set of alternatives S = 
{s\,S2, •••}• In non-relativistic QM, the probability of finding a particle in a certain spatial region R 
is usually understood as the probability of finding the particle in this region out of the alternatives 
given by the possibility of finding the particle in other spatial regions at the same time. That is, if 
s = (R, t), then S = St = {(R n j t)} where the R n are an ensemble of regions that cover an equal time 
surface. In a general rclativistic context, there is nothing that singles out the equal time surface. Does 
this imply that the probability V s has no meaning in a general relativistic context? 
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The answer is no, for the following reason. There exists an alternative interpretation of the 
probability V s that does not require the set of alternatives St — {(R n ,t)}. Consider a detector which 
is active in the space region R at time t. This detector has a finite probability of detecting the particle 
and a finite probability of not detecting the particle. These two alternatives define a simpler set 

Sduai = {Detection, Non detection}. (50) 

We can then interpret V s as the probability that detector detects the particle, out of the two alterna- 
tives (jHOJl- This does not require the equal time surface St to play any role. 

In fact, a moment of reflection will convince the reader that that this is what we truly mean by 
V s in any realistic quantum mechanical measurement. If the set of alternatives St was the relevant 
one, any position measurement would only be consistent if, at the same time t, there were detectors 
all over the universe, all measuring whether the particle is there! This is not what we do when we 
measure if the particle is in a certain region at a certain time. What we do is to have a detector only 
in the region of concern and interpret the case of non detection as implying that the particle would 
have been detected by one detector elsewhere. We can do so, because in ordinary non-relativistic 
quantum mechanics, we have the remarkable property that 



E 



?W) = !• (51) 



But this is a specific property of the dynamics of a particle, not a condition for Vr^j to be defined. 
Vn n ,t is defined by itself, and the probability normalization condition is simply 

7~*(Ii,t) 7~*Non detection — Detection T^Non detection 1- (52) 

Let us express the same idea in more mathematical terms. Let \x)t be an eigenvectors of the 
Heisenberg position operator X{t), that is X(t)\x)t — x\x)t- The state \x)t is, of course, also an 
eigenvector of the proposition operator 

Pxt = \x)tt{x\. (53) 
The two operators are related by the spectral decomposition 

X(t)= [ dxx P xt (54) 



in nonrelativistic QM, but they are independently defined and they have each a physical interpretation. 
The operator X(t) describes an ensemble of detectors covering the entire space and measuring where 
is the particle at time t. Its outcome is a real number x. The operator P xt describes a single detector 
that detects whether or not the particle is in x at time t. Its outcome is a single bit: either YES or 
NO. Now the relation (|54|l implies 151fl : if this relation is not available, the proposition operator P xt 
is still well defined, and still defines consistent probabilities for this detector outcomes. 



6 Perspectives 

The main result of this work is to show that the single probability postulate J7J) of general relativistic 
quantum mechanics is capable of giving all probabilities of conventional quantum mechanics, including 
the probabilities for sequences of events, which are usually computed by means of the projection 
algorithm. This is achieved by exploiting the freedom of moving the quantum/classical boundary, 
emphasized by von Neumann, and assuming that (in the non-relativistic case) the evolution of the 
system+apparatus is always unitary. 
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A number of points deserve to be better understood. In particular: (i) The use of this formalism in 
less trivial systems, where the complications connected to continuous spectrum operators and infinite 
volume gauge groups are more severe, (ii) The possibility of associating probabilities to arbitrary 
continuous regions of C ^3 E| • This is related to the well known "time of arrival" problem (see for 
instance |17| and references therein), (Hi) The extension of these ideas to field theory, and in particular 
the connection between this formalism and the boundary formalism [HI which is presently used 
to compute probability amplitudes in background independent quantum gravity, in the context 
of loop quantum gravity |51 12UI I^T] . (iv) Eventually, we would like to apply this formalism to physical 
situations where the the assumption of the existence of a background geometry breaks down (for 
instance, see [22] )■ We leave these issues for further developments. 

A number of tentative considerations following from the present result can nevertheless be at- 
tempted. 

(i) Time ordering does not appear to be a fundamental structure required for the definition of 
quantum theory and the calculation of its probability amplitudes. In our opinion, this reinforces the 
hypothesis that the fundamental theory of nature can be formulated in a timeless language [22| , and 
that temporal phenomena could be emergent |24| . 

(ii) In the generally covariant context, dynamics can be entirely expressed in terms of Dirac 
observables. Indeed, notice that the probability of a sequence of measurements can be written as in 
equation (|10|l . namely as the expectation value of the projection operator IT defined in ©, or 1)440. 
This operator is a Dirac observable of the extended system that includes the measuring devices. 

In the present context, this is the answer to the long-standing problem of the description of 
dynamics in the "frozen-time" formalism; namely in the Dirac's quantization of a system whose 
dynamics is expressed by constraints 0E|- Dynamics is coded into (non-commuting) Dirac observables 
defined in terms of sets of interactions between (what we call) the system and (what we call) the 
measuring devices. 

(Hi) The discussion above bears also, indirectly, on the discussion on the interpretation of quantum 
mechanics, and on the nature of the quantum collapse. 

In some interpretations of quantum mechanics, the wave function is considered a real entity that 
evolves unitarily, except at measurement time, when it undergoes a sudden change. In particular, 
some interpretations make the hypothesis that this "collapse" is a real physical phenomenon whose 
peculiar nonlocal dynamics is not yet understood. If this is the case, the full freedom of moving the 
quantum/classical boundary is broken, because once the collapse has happened no more interference 
between the two "branches" of a measurement outcome is possible, even in principle. If this is the 
case, the strategy adopted here is not viable in general, because it assumes, instead, that no true 
physical collapse happens at anytime. 

In some others interpretations, the wave function, or the "quantum state" , is not considered as 
a real entity. Rather, only quantum "events" are considered real, and probabilities like |(s'|s)| 2 are 
directly interpreted as conditional probabilities for these events to happen. In particular, in |2f>j these 
quantum events are assumed to happen at interactions between systems, and to be real only with 
respect to the interacting systems themselves. From this perspective, there is no specific physical 
phenomenon corresponding to a quantum collapse, and the strategy considered here is viable. With 
respect to an external system, what happens at the interaction between system and apparatus is not 
a sudden change in a hypothetical real "state" , but simply an entanglement between the probabilities 
of various outcomes of observations on the system or the apparatus. We refer to [2H], and Section 5.6 
of [Hj for a discussion of this point of view. 

(iv) To our knowledge, the only complete general covariant formalism for quantum theory alter- 
native to the one we have discussed here is Hartle's generalized quantum mechanics [Hj. We find an 
interesting convergence between Hartle's covariant sum over histories and our results. Within Hartle's 
generalized quantum mechanics, probabilities for sequences of events can be expressed by means of 
history operators. What we have argued here is that the mean value of a history operator can be 
re-expressed as the mean value of a conventional projection operator on the joint system-|- apparatus 
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Hilbert space. This relations will be better discussed and illustrated in the companion paper 

(v) Finally, in our opinion the result presented here reinforce the idea that quantum mechanics 
admits a very simple and straightforward generalization which is fully consistent with general relativity. 
And therefore that the contradictions between quantum theory and general relativity might be only 
apparent. 
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